Word Knowledge Acquisition for Computational Lexicon Construction

نویسندگان

  • Thatsanee Charoenporn
  • Canasai Kruengkrai
  • Thanaruk Theeramunkong
  • Virach Sornlertlamvanich
  • Hitoshi Isahara
چکیده

The growing of multilingual information processing technology has created the need of linguistic resources, especially lexical database. Many attempts were put to alter the traditional dictionary to computational dictionary, or widely named as computational lexicon. TCL’s Computational Lexicon (TCLLEX) is a recent development of a large-scale Thai Lexicon, which aims to serve as a fundamental linguistic resource for natural language processing research. We design either terminology or ontology for structuring the lexicon based on the idea of computability and reusability.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Whole word morphologizer: expanding the word-based lexicon: a nonstochastic computational approach.

Whole Word Morphologizer is a small computer implementation of word-based morphology. The program automatically identifies morphological relations in a small word-based lexicon, literally learning its morphology, and uses the knowledge it acquires to generate new words. It is based on a model of the mental lexicon in which all entries are whole, entire, fully fledged words and relies solely on ...

متن کامل

A Case-Based Approach to Knowledge Acquisition for Domain-Specific Sentence Analysis

This paper describes a case-based approach to knowledge acquisition for natural language systems that simultaneously learns part of speech, word sense, and concept activation knowledge for all open class words in a corpus. The parser begins with a lexicon of function words and creates a case base of context-sensitive word definitions during a humansupervised training phase. Then, given an unkno...

متن کامل

Lexical Knowledge Acquisition from Corpora

The paper presents a computational environment to support developing a lexicon for natural language processing. The underlying idea of the environment is to utilize up-to-date language technologies to minimize both the human labor and the inconsistency that are unavoidable in manual compilation of a lexicon. The proposed computational environment enables an efcient construction of a consistent ...

متن کامل

Word Maturity: Computational Modeling of Word Knowledge

While computational estimation of difficulty of words in the lexicon is useful in many educational and assessment applications, the concept of scalar word difficulty and current corpus-based methods for its estimation are inadequate. We propose a new paradigm called word meaning maturity which tracks the degree of knowledge of each word at different stages of language learning. We present a com...

متن کامل

The Self-Extending Phrasal Lexicon

Lexical representation so far has not been extensively investigated in regard to language acquisition. Existing computational linguistic systems assume that text analysis and generation take place in conditions of complete lexical knowledge. That is, no unknown elements are encountered in processing text. It turns out however, that productive as well as non-productive word combinations require ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2006